System Building Cost vs. Output Quality in Data-to-Text Generation

نویسندگان

  • Anja Belz
  • Eric Kow
چکیده

Data-to-text generation systems tend to be knowledge-based and manually built, which limits their reusability and makes them time and cost-intensive to create and maintain. Methods for automating (part of) the system building process exist, but do such methods risk a loss in output quality? In this paper, we investigate the cost/quality trade-off in generation system building. We compare four new data-to-text systems which were created by predominantly automatic techniques against six existing systems for the same domain which were created by predominantly manual techniques. We evaluate the ten systems using intrinsic automatic metrics and human quality ratings. We find that increasing the degree to which system building is automated does not necessarily result in a reduction in output quality. We find furthermore that standard automatic evaluation metrics underestimate the quality of handcrafted systems and over-estimate the quality of automatically created systems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessing the Trade-Off between System Building Cost and Output Quality in Data-to-Text Generation

Data-to-text generation systems tend to be knowledge-based and manually built, which limits their reusability and makes them time and cost-intensive to create and maintain. Methods for automating (part of) the system building process exist, but do such methods risk a loss in output quality? In this paper, we investigate the cost/quality trade-off in generation system building. We compare six da...

متن کامل

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

Designing an Optimal System of Combined Heat, Cold and Power for a Building

In this study, supply electrical, heating and cooling loads in the building by a separate production system and a cogeneration system, particle swarm optimization algorithm in five different scenarios were studied and analyzed (For example, Data and Information a high-rise building base 72, in the city of Kerman is used). The results show that, cold and heat power system of micro gas turbine wi...

متن کامل

Modeling and sizing optimization of hybrid photovoltaic/wind power generation system

The rapid industrialization and growth of world’s human population have resulted in the unprecedented increase in the demand for energy and in particular electricity. Depletion of fossil fuels and impacts of global warming caused widespread attention using renewable energy sources, especially wind and solar energies. Energy security under varying weather conditions and the corresponding system ...

متن کامل

The Effect of Iranian EFL Learners’ Self-generated vs. Group-generated Text-based Questions on their Reading Comprehension

Reading comprehension is one of the most important skills, especially in the EFL context. One way to improve reading comprehension is through strategy use. The present study aimed at investigating the effect of question-generation strategy on learners' reading comprehension. The participants in the study were 63 intermediate students from three intact groups in Resa institute in Boukan, They we...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009